在本文中,我们介绍了西班牙语中的第一个系统,能够回答有关人类使用药物的问题,称为Meqa(药物问题回答),由西班牙药品和健康产品(AEMPS,以西班牙语的首字母缩略词为本)创建的项目。提供医疗帮助的在线服务大大增殖,主要是由于Covid-19由于目前的大流行情况。例如,诸如Doctoralia,Savia或Saludonnet等网站提供医生答案类型咨询,其中患者或用户可以向医生和专家发送问题,并在不到24小时内接收答案。收到的许多问题与人类使用的药物有关,大多数都可以通过传单回答。因此,能够自动回答这些类型问题的MEQA等系统可以减轻这些网站的负担,并且对这些患者来说是很好的用途。
translated by 谷歌翻译
Spacecraft pose estimation is a key task to enable space missions in which two spacecrafts must navigate around each other. Current state-of-the-art algorithms for pose estimation employ data-driven techniques. However, there is an absence of real training data for spacecraft imaged in space conditions due to the costs and difficulties associated with the space environment. This has motivated the introduction of 3D data simulators, solving the issue of data availability but introducing a large gap between the training (source) and test (target) domains. We explore a method that incorporates 3D structure into the spacecraft pose estimation pipeline to provide robustness to intensity domain shift and we present an algorithm for unsupervised domain adaptation with robust pseudo-labelling. Our solution has ranked second in the two categories of the 2021 Pose Estimation Challenge organised by the European Space Agency and the Stanford University, achieving the lowest average error over the two categories.
translated by 谷歌翻译
Privacy-preserving machine learning in data-sharing processes is an ever-critical task that enables collaborative training of Machine Learning (ML) models without the need to share the original data sources. It is especially relevant when an organization must assure that sensitive data remains private throughout the whole ML pipeline, i.e., training and inference phases. This paper presents an innovative framework that uses Representation Learning via autoencoders to generate privacy-preserving embedded data. Thus, organizations can share the data representation to increase machine learning models' performance in scenarios with more than one data source for a shared predictive downstream task.
translated by 谷歌翻译
In this work, we propose a framework relying solely on chat-based customer support (CS) interactions for predicting the recommendation decision of individual users. For our case study, we analyzed a total number of 16.4k users and 48.7k customer support conversations within the financial vertical of a large e-commerce company in Latin America. Consequently, our main contributions and objectives are to use Natural Language Processing (NLP) to assess and predict the recommendation behavior where, in addition to using static sentiment analysis, we exploit the predictive power of each user's sentiment dynamics. Our results show that, with respective feature interpretability, it is possible to predict the likelihood of a user to recommend a product or service, based solely on the message-wise sentiment evolution of their CS conversations in a fully automated way.
translated by 谷歌翻译
We present the GPry algorithm for fast Bayesian inference of general (non-Gaussian) posteriors with a moderate number of parameters. GPry does not need any pre-training, special hardware such as GPUs, and is intended as a drop-in replacement for traditional Monte Carlo methods for Bayesian inference. Our algorithm is based on generating a Gaussian Process surrogate model of the log-posterior, aided by a Support Vector Machine classifier that excludes extreme or non-finite values. An active learning scheme allows us to reduce the number of required posterior evaluations by two orders of magnitude compared to traditional Monte Carlo inference. Our algorithm allows for parallel evaluations of the posterior at optimal locations, further reducing wall-clock times. We significantly improve performance using properties of the posterior in our active learning scheme and for the definition of the GP prior. In particular we account for the expected dynamical range of the posterior in different dimensionalities. We test our model against a number of synthetic and cosmological examples. GPry outperforms traditional Monte Carlo methods when the evaluation time of the likelihood (or the calculation of theoretical observables) is of the order of seconds; for evaluation times of over a minute it can perform inference in days that would take months using traditional methods. GPry is distributed as an open source Python package (pip install gpry) and can also be found at https://github.com/jonaselgammal/GPry.
translated by 谷歌翻译
我们提出了一种基于情节知识图(EKG)的新方法,用于评估开放域中的(多模式)对话剂。该图是通过解释对话过程中的原始信号而生成的,并且能够随着时间的推移捕获知识的积累。我们应用对所得图的结构和语义分析,并将这些属性转化为定性措施。我们将这些措施与通常用于对话代理的现有自动和手动评估指标进行比较。我们的结果表明,我们的基于知识的评估为互动和代理人的行为提供了更多的定性见解。
translated by 谷歌翻译
本文描述了我们对第9届论证挖掘研讨会共同任务的贡献(2022)。我们的方法使用大型语言模型来进行论证质量预测的任务。我们使用GPT-3进行及时的工程,并研究培训范式多任务学习,对比度学习和中任务培训。我们发现混合预测设置优于单个模型。提示GPT-3最适合预测论点有效性,而论证新颖性最好通过使用所有三个训练范式训练的模型来估算。
translated by 谷歌翻译
可以通过玩游戏来训练代理商来回答困难的数学问题吗?我们考虑了整数可行性问题,这是决定线性方程和不平等系统是否具有具有整数值的解决方案的挑战。对于许多数学和计算机科学领域的应用,这是一个著名的NP完整问题。我们的论文描述了一个新颖的代数增强学习框架,该框架使代理商可以玩相当于整数可行性问题的游戏。我们解释了如何将整数可行性问题转换为具有固定保证金总和的一组阵列的游戏。游戏从初始状态(数组)开始,并采取法律举措使利润率保持不变,我们的目标是最终与零位置的零位置达到胜利状态。为了赢得比赛,玩家必须在初始状态和最终终端获胜状态之间找到一条路径。找到这样的获胜状态等同于解决整数可行性问题。关键代数成分是“基础轴向运输polyhedron的曲折理想的基础”。gr \'obner可以看作是游戏的一组连接移动(动作)。然后,我们提出了一种新型的RL方法,该方法训练代理以预测连续空间中的移动,以应对较大的动作空间。然后将连续的移动投射到一组法律移动上,以使该路径始终导致有效状态。作为概念的证明,我们在实验中证明了我们的代理商可以很好地发挥我们最简单的游戏版本,用于2向表。我们的工作突出了培训代理商通过当代机器学习方法来训练代理商玩游戏的潜力来解决非平凡的数学查询的潜力。
translated by 谷歌翻译
语言模型(LMS)已被证明在各种下游应用程序中很有用,例如摘要,翻译,问答和文本分类。由于它们可以存储的大量信息,LMS正在成为人工智能中越来越重要的工具。在这项工作中,我们提出了道具(提示为探测),该道具利用GPT-3(最初由OpenAI在2020年提出的大型语言模型)来执行知识基础构建任务(KBC)。 Prop实施了一种多步骤方法,该方法结合了各种提示技术来实现这一目标。我们的结果表明,手动提示策划是必不可少的,必须鼓励LM给出可变长度的答案集,特别是包括空的答案集,True/False问题是提高LM生成的建议精度的有用设备。 LM的大小是至关重要的因素,并且实体字典别名提高了LM评分。我们的评估研究表明,这些提出的技术可以大大提高最终预测的质量:Prop赢得了LM-KBC竞争的轨道2,表现优于基线36.4个百分点。我们的实施可在https://github.com/hemile/iswc-challenge上获得。
translated by 谷歌翻译
考虑到大量未标记的语音数据和高标签成本,无监督的学习方法对于更好的系统开发至关重要。最成功的方法之一是对比度的自我监督方法,这些方法需要负采样:采样替代样品与当前样品(锚)对比。但是,很难确保所有负样本属于与没有标签的锚类别不同的​​类别。本文在未标记的语音语料库上应用了一种非对抗性的自我监督学习方法来学习话语级的嵌入。我们使用没有标签的蒸馏(Dino),在计算机视觉中提出,并将其改编为语音域。与对比度方法不同,Dino不需要负采样。这些嵌入是根据说话者验证和情感识别评估的。在说话者验证中,无监督的恐龙与余弦评分嵌入了voxceleb1测试试验中的4.38%EER。这表现优于最佳的对比度自我监督方法,而EER中的相对相对40%。不需要扬声器标签的迭代伪标记训练管道将EER进一步提高到1.89%。在情感识别中,Iemocap,Crema-D和MSP播客的Micro-F1得分分别进行了60.87、79.21和56.98%的恐龙。结果暗示着恐龙嵌入到不同语音应用中的普遍性。
translated by 谷歌翻译